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ABSTRACT 

Despite the vast excess of cellular RNAs, precisely two copies of viral genomic RNA (gRNA) are selectively packaged into new 
human immunodeficiency type 1 (HIV-1) particles via specific interactions between the HIV-1 Gag and the gRNA psi (\|/) 
packaging signal. Gag consists of the matrix (MA), capsid, nucleocapsid (NC), and p6 domains. Binding of the Gag NC domain 
to v is necessary for gRNA packaging, but the mechanism by which Gag selectively interacts with \|f is unclear. Here, we 
investigate the binding of NC and Gag variants to an RNA derived from \|/ (Psi RNA), as well as to a non-ip region (TARPolyA). 
Binding was measured as a function of salt to obtain the effective charge (Z e ff) and nonelectrostatic (i.e., specific) component 
of binding, K^-, M) . Gag binds to Psi RNA with a dramatically reduced Kd<iM> and lower Z e ff relative to TARPolyA. NC, 
GagAMA, and a dimerization mutant of Gag bind TARPolyA with reduced Z eff relative to WT Gag. Mutations involving the NC 
zinc finger motifs of Gag or changes to the G-rich NC-binding regions of Psi RNA significantly reduce the nonelectrostatic 
component of binding, leading to an increase in Z en -. These results show that Gag interacts with gRNA using different binding 
modes; both the NC and MA domains are bound to RNA in the case of TARPolyA, whereas binding to Psi RNA involves only 
the NC domain. Taken together, these results suggest a novel mechanism for selective gRNA encapsidation. 
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INTRODUCTION 

Amidst the vast excess of cellular RNAs, assembling human 
immunodeficiency virus type 1 (HIV-1) virions selective- 
ly package two copies of the viral RNA genome (gRNA). 
During assembly, viral Gag proteins coalesce at the plasma 
membrane and bud off to form new virions ~100 nm in size 
(for review, see Sundquist and Krausslich 2012). The Gag pre- 
cursor protein consists of the matrix (MA), capsid (CA), spac- 
er peptide 1 (SP1), nucleocapsid (NC), spacer peptide 2 (SP2), 
and p6 domains, which are cleaved into freestanding proteins 
during virus maturation (Coffin et al. 1997; Fields et al. 2007). 
During assembly of immature virions, Gag binds to gRNA via 
NC and to the plasma membrane via the MA domain. In ad- 
dition, the CA domain forms CA-CA contacts critical for as- 
sembly (for review, see Ganser-Pornillos et al. 2012; Sundquist 
and Krausslich 2012). 
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The 5' untranslated region (UTR) of the HIV-1 viral RNA 
contains motifs required for gRNA packaging commonly 
referred to as the Psi (\|/) packaging signal (Lever et al. 1989; 
Wilkinson et al. 2008), which consists of a series of stem-loops 
(SL1-SL4) . Herein \|/ refers to the packaging signal in the virus, 
and Psi RNA refers to the RNA construct containing SL1-SL3 
(Fig. 1), which we have examined in this report. For some ret- 
roviruses, such as murine leukemia virus (MLV) and Rous sar- 
coma virus, the minimal \|/ packaging element is well defined 
(Lu et al. 201 lb, and references therein), but in HIV-1, in ad- 
dition to SL1-SL4, other elements in the 5' UTR contribute ei- 
ther directly or indirectly to packaging (for review, see Lu et al. 
201 lb) (Skripkin et al. 1994; Russell et al. 2003; Parkash et al. 
2012). Some specific cellular RNAs that lack \|/ are also pack- 
aged into virions (Houzet et al. 2007; Kleiman et al. 2010; 
Keene and Telesnitsky 2012), and in the absence of ^-contain- 
ing RNAs, selectively packaged cellular RNA levels increase 
while additional cellular RNAs are also abundantly and non- 
selectively incorporated into virions (Muriaux et al. 2001; 
Rulli et al. 2007). 

Aside from the \\i stem-loops, the 5' UTR of the HIV- 1 ge- 
nome also contains the transactivation response (TAR) stem- 
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FIGURE 1 . Protein and RNA constructs used in this work. (A) The TARPolyA RNA and Psi RNA are derived from the NL4-3 virus isolate. A single G 
residue was appended to the 5' end of TARPolyA, and three G residues were appended to the 5' end of Psi RNA to improve in vitro transcription. Psi 
RNA contains the dimerization initiation site (DIS) shown in red with the residue C258 (green) mutated to G to generate monomeric Psi. The 12 res- 
idues changed to A in the Psi- 1 2M variant are shown in blue. (B) GagAp6 protein constructs examined in this work with full-length Gag ( top) shown for 
comparison. The amino acid sequence of NC is shown with zinc ions chelated by the invariant CCHC motif of each zinc finger. Zinc finger variants 
studied herein are depicted in cartoon form, with single H-to-C point mutations indicated by a red asterisk and with deletions indicated by an X. 



loop critical for viral RNA transcription, the polyadenylation 
site-containing PolyA stem-loop (PolyA), and the primer 
binding site (PBS), which harbors the sequence that anneals 
to primer tRNA Lys3 for reverse transcription initiation (Fig. 
1A). Within \|/, SL1 contains the dimerization initiation site 
(DIS), a palindromic sequence (GCGCGC in the NL4-3 iso- 
late) responsible for dimerization of the two copies of pack- 
aged gRNA (Skripkin et aL 1994; for review, see Moore and 
Hu 2009; Lu et al. 2011b). SL2 contains the splice donor 
site, critical for splicing of the viral RNA for translation of viral 
accessory proteins, and SL3 is a sequence shown to contribute 
to gRNA packaging (Clever and Parslow 1997; McBride and 
Panganiban 1997; Russell et al. 2003) and high-affinity NC 
binding (De Guzman et al. 1998; Athavale et al. 2010). In 
HIV- 1 , genome packaging and dimerization are hypothesized 
to be coordinated processes, as virions contain only dimeric 
gRNA (Moore and Hu 2009; Nikolaitchik et al. 20 1 3 ) . A model 
for the switch between translation and packaging has been 
proposed based on NMR spectroscopy data showing that the 
DIS loop of SL1 folds back and interacts with an upstream re- 
gion in the viral RNA, thereby preventing dimerization of SL 1 
and promoting translation (Lu et al. 201 la). In MLV, dimeri- 
zation of the gRNA exposes high- affinity binding sites for the 
NC domain of Gag (D'Souza and Summers 2004; Gherghe 
et al. 2010; Miyazaki et al. 2010), suggesting that packaging 
and Gag binding to \|/ are coordinated with dimerization. In 
HIV-1, a 159-nt "core encapsidation signal" was recently de- 



scribed that binds NC with similar affinity as the full-length 
5' leader sequence (Heng et al. 2012). However, there may 
be important differences in the mechanisms by which the 
discrete NC domain and the full-length Gag protein bind to 
gRNA with high affinity. 

Recent reports have shown that HIV- 1 Gag interacts with 
nucleic acids via both the MA and NC domains. MA interac- 
tions with RNA are secondary to its binding to phosphatidyli- 
nositol-(4, 5)-bisphosphate (PIP 2 )-containing lipids once it 
reaches the plasma membrane, whereas NC prefers to bind 
to RNA (Alfadhli et al. 2009; Chukkapalli et al. 2010; Jones 
et al. 2011). MA is myristoylated and also contains a basic 
patch, which enhances its binding to PIP 2 (Chukkapalli 
et al. 2008, 2010). NC is a highly basic protein containing 
two CCHC zinc fingers (Fig. 1 ) that are critical for RNA-bind- 
ing specificity, nucleic acid chaperone activity (Levin et al. 
2010; Darlix et al. 2011), and gRNA packaging (Gorelick 
et al. 1999b; Kafaie et al. 2008). Investigations into the RNA- 
binding specificity of Gag have previously studied the binding 
of the NC protein to RNA and DNA (Dannull et al. 1994; 
Fisher et al. 1998, 2006; Vuilleumier et al. 1999; Avilov et al. 
2009; Athavale et al. 2010), and more recently the binding of 
Gag to short oligonucleotides has also been investigated 
(Stephen et al. 2007; Wu et al. 2010; Jones et al. 2011). In 
this study, we investigate the binding of Gag and NC to longer 
~ 100-nt RNAs derived from the gRNA 5' UTR. To better un- 
derstand how Gag-\[/ binding contributes to specific gRNA 
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packaging, two model RNAs were chosen: Psi RNA, which 
contains SL1-SL3, and a non-Psi construct containing the 
TAR and PolyA stem-loops (TARPolyA) (Fig. 1). We used a 
salt titration approach, which allowed determination of both 
the electrostatic and nonelectrostatic contributions to bind- 
ing. Our findings suggest that zinc finger-dependent Gag-y 
interaction results in a specific binding mode that facilitates 
selective packaging of gRNA. 

RESULTS 

Direct binding assays show salt-resistant binding 
of Gag to Psi RNA 

To examine the interactions between Gag and RNAs derived 
from the HIV-1 genome, we first performed direct bind- 
ing measurements in which either fluorescently labeled 
TARPolyA or Psi RNA (Fig. 1) was incubated with various 
amounts of Gag or NC protein. The recombinant GagAp6 
protein (referred to as WT Gag or Gag) used herein is not 
myristoylated and lacks the C-terminal p6 domain (Fig. 1). 
Binding was detected via fluorescence anisotropy (FA) using 
fluorescently labeled RNAs, and assays were performed at var- 
ious [NaCl] concentrations (50-500 mM). Although binding 
affinities of Gag to Psi RNA and TARPolyA were similar at 50 
mM NaCl {K d ~ 50 nM), at 0.5 M NaCl, Gag still binds Psi 
RNA (_K"d = 300 nM), whereas no binding was observed to 
TARPolyA (Fig. 2A,B). NC showed robust binding to Psi 
RNA at 50 mM and 150 mM NaCl (K d = 50 nM and K d = 74 
nM, respectively) and weaker binding at 0.5 M NaCl {K d > 
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FIGURE 2. Direct binding assays using fluorescence anisotropy equilibrium measurements. 
Binding of GagAp6 to (A) Psi RNA and {B) TARPolyA at various NaCl concentrations are shown. 
Binding of NC to (C) Psi RNA and (D) TARPolyA at various NaCl concentrations are shown. 



2000 nM) (Fig. 2C). NC binding to TARPolyA was even 
more salt sensitive with very weak binding observed at 150 
mM NaCl (Fig. 2D). These assays showed a dramatic diffe- 
rence in the salt dependence of Gag and NC binding to \\i vs. 
non-\|/ RNAs. The apparent salt independence of iCj's under 
low-salt conditions is an artifact of the RNA concentration 
used (~30-40 nM), which was similar to the K d values de- 
termined. To overcome this technical issue and to obtain ad- 
ditional insights into the mechanism of binding, a salt titration 
assay was used. 

Gag binds Psi RNA with high affinity and specificity 

To gain additional insights into t|/ vs. non-\|/ RNA binding 
of Gag and NC, a salt titration assay was performed in which 
a fixed concentration of Gag or NC protein was bound to 
RNA (~30 nM) and FA was measured as a function of 
[NaCl]. In this experiment, as [NaCl] increases over a range 
of from 50 mM to 1 M, Gag was gradually displaced from 
the labeled RNA (either Psi or TARPolyA), leading to a 
decrease in the FA signal. A protein concentration of 400 
nM was chosen based on the direct binding experiments 
(Fig. 2); however, similar results were obtained when experi- 
ments were performed at 750 nM protein (Supplemental 
Information; Supplemental Fig. S 1 ) . Figure 3 shows the results 
of salt titration assays for Gag (Fig. 3A) and NC (Fig. 3B) bind- 
ing to TARPolyA and Psi. 

Dramatic differences in the salt dependence of binding 
to each RNA were observed. Data from salt titrations were fit 
to determine the nonelectrostatic compo- 
nent of binding (i.e., protein-RNA disso- 
ciation constant at 1 M salt, -Kd(iM)) an< ^ 
the effective charge (Z e ff) of the protein 
interface involved in direct RNA binding 
as described in the Supplemental Infor- 
mation. The fits were graphically repre- 
sented by plotting K d vs. [NaCl] (Fig. 
3C), and the iQ(iM) an d Z e g values, 
calculated according to equation 1 (see 
Materials and Methods), are reported in 
Table 1 . Because the salt effect on cationic 
protein binding is largely insensitive to 
the identity of the cation, given the same 
net positive charge (i.e., Na + vs. K + or 
Mg 2+ vs. Ca 2+ ) (Rouzina and Bloomfield 
1997; Vo et al. 2006; Athavale et al. 
2010), using this analysis iQ maybe pre- 
dicted for a particular protein-RNA pair 
under different buffer conditions. Kd(iM) 
reflects the binding under conditions in 
which all electrostatic interactions are 
screened out by salt, and the only contri- 
bution to binding comes from specific 
contacts such as aromatic residue stack- 
ing with unpaired nucleic acid bases or 
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FIGURE 3. Salt titration of Gag and NC binding to Psi RNA and 
TARPolyA. (A) Prebound Gag-Psi RNA or Gag-TARPolyA complexes 
were titrated with increasing amounts of NaCI, leading to a decrease in 
the anisotropy. (B) A similar set of titrations was performed with NC- 
Psi RNA and NC-TARPolyA complexes. (C) Data from A and B are 
regraphed in a log-log plot showing the dependence of the apparent 
binding affinity (K^) on NaCI concentration according to 
Kd = ^d(iM) ' [Na] Zdf . The slope of each line is the Z^s value and the val- 
ue of the Jfj at 1 M NaCI is the ifd(iM)- The Z e ff and -fCd(iM) parameters 
were obtained by fitting the experimental salt titration data. A complete 
description of this method and subsequent analysis are available in the 
Materials and Methods and the Supplemental Methods. 

specific hydrogen bonds. Moreover, the measured bind- 
ing is to the highest affinity RNA site, as discussed in the 
Supplemental Information. Z e g is a measure of the number 
of Na + ions displaced from RNA upon protein binding and re- 
flects the number of the positively charged groups at the pro- 
tein surface directly interacting with the RNA phosphates 
(Record et al. 1976; Rouzina and Bloomfield 1997). In solu- 
tion, ~70% and ~90% of the phosphate backbone is occupied 
by Na + ions for ssDNA and dsDNA, respectively (Record et al. 
1976). Because the Na + occupancy of the structured RNAs 
studied here (Fig. 1) likely falls between these two numbers, 
our reported Z e ff values have not been normalized to account 
for partial occupancy. We ensured that the Z e s and iCj ( 1M) val- 
ues calculated were insensitive to the number of protein-bind- 



ing sites on each RNA (Supplemental Table SI ). In addition to 
the fitted parameters Z e g and iCj( 1M) , we also determined AA, 
the total change in anisotropy upon protein binding. This val- 
ue depends on the change in size between free and bound 
RNA/protein complex and is distinct for every protein-RNA 
pair. Finally, for each protein/RNA interaction, we also report 
Nai/2 values, which are the titration midpoints, or the amount 
of NaCI required to displace 50% of bound protein. The Na 1/2 
and Anisotropy vs. [Na] slope at Nai/ 2 are related to bind- 
ing parameters Z eff , K d{1M) , and AA, as discussed in the 
Supplemental Information, and described by equations S12 
and SI 3, respectively. 

NC binding to TARPolyA (Fig. 3B) was characterized by a 
K d(1M ) of 2.3 x 10" 3 M, which is ~100-fold weaker than NC 
binding to Psi RNA (K d{lM) ~ 2.6 x 10" 5 M). This finding is 
in agreement with previous studies showing ~ 1 00-fold affinity 
range for the 1M) values for NC binding to 6-nt ssDNA oli- 
gos ( Vuilleumier et al. 1 999 ) . NC/TARPolyA and NC/Psi RNA 
interactions were both characterized by a Z e ff of ~3-4, in 
agreement with previous studies (Vuilleumier et al. 1999; Vo 
et al. 2009a; Athavale et al. 2010). Furthermore, the nonelec- 
trostatic binding component of Gag, iCj(iM) was even more 
selective for Psi RNA relative to TARPolyA (~ 1000-fold) 
than NC (Table 1). Moreover, Gag bound to TARPolyA 
with an approximately twofold greater Zgg- of 9 relative to Psi 
RNA (Zgff = 5) (Table 1) — suggesting that Gag interacts with 
TARPolyA with four additional positive charges relative to 
Psi RNA. As shown in Figure 2, A and B, to displace Gag 
from TARPolyA or Gag from Psi, a greater amount of NaCI 
was required than for the corresponding NC interactions. 
This finding is reflected by higher Na 1/2 values for Gag/RNA 
interactions (Table 1 ) and suggests that Gag domains outside 
of NC contribute to binding. Importantly, at physiological 
NaCI (~150 mM), the estimated i<T d 's for Gag binding to 
TARPolyA RNA and Psi RNA differ by approximately sixfold. 
Surprisingly, Gag binds to TARPolyA with a higher affinity 
than to Psi RNA, suggesting that selective packaging of Psi- 
containing RNAs is unlikely to be due to stronger equilibrium 
binding affinity. In summary, Gag binds Psi RNA with re- 
duced charge interactions and much stronger nonelectrostatic 
interactions relative to TARPolyA binding. 

Both MA and NC domains contribute 
to Gag/TARPolyA binding 

To dissect the difference between Psi and TARPolyA bind- 
ing at a molecular level, salt titration assays were performed 
with Gag variants CANC and W316A,M317A-GagAp6 
(WM-Gag). The latter variant contains changes in CA residues 
that are critical for Gag-Gag dimerization (Datta et al. 2007). 
Binding of either of these proteins to TARPolyA or Psi RNA 
is characterized by Z e ff values of ~4-5, similar to the value ob- 
tained for WT Gag binding to Psi RNA (Table 1 ; Supplemental 
Fig. S2 ) . Thus, deletion of the MA domain removes additional 
positive charges involved in TARPolyA binding, but not Psi 
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TABLE 1. Binding 


; parameters for GagAp6 variants 






Psi 


GagAp6 Variant 


AA a 


Na 1/2 (mM) b 


Kd(iM) (M)C 




WT 


0.1 5 ±0.006 


372 ±3 


(5.2±1)x10~ 5 


5.0 ±0.2 


WT (750 nM) 


0.1 6 ±0.008 


439 ±7 


(6.3 ± 3) x 10~ 5 


5.6±0.5 


CANC 


0.094 ±0.001 


188 ±4 


(2.5 ± 1) x 10~ 4 


4.0 ±0.3 


NC 


0.087 ±0.003 


272 ±6 


(2.6±0.8)x10~ 5 


3.3 ±0.2 


WM 


0.1 2 ±0.003 


305 ±5 


(1.0±0.4)x10~ 4 


4.9 ±0.3 


3M 


0.1 5 ±0.006 


551 ±9 


(1.3 + 0.6)x10~ 5 


6.3 ±0.6 


K30,32N 


0.14±0.002 


325±4 


(3.2±0.7)x10~ 5 


4.1 ±0.2 


AZF1 


0.14±0.009 


371 ±4 


(3.9±1)x10~ 5 


5.0 ±0.2 


AZF2 


0.1 5 ±0.02 


208 ±2 


(1.6±2)x 10~ 2 


7.7±0.5 


AZF1 +2 


0.1 3 ±0.03 


206 ±2 


(1.9±2)x 10~ 1 


8.7±0.5 


H400C 


0.1 4 ±0.0006 


294 ±3 


(6.2 ± 6) x 1 0~ 3 


7.9 ±0.7 


H421C 


0.1 6 ±0.02 


225 ±2 


(7.5 ± 5) x 10~ 4 


6.3 ±0.4 


H400,421C 


0.14±0.02 


202 ±2 


(5.4 + 9) x10~ 1 


9.8±0.7 


TARPolyA 


WT 


0.1 6 ±0.008 


238 ±1 


(2.2±1)x10~ 2 


9.1 ±0.3 


WT (750 nM) 


0.1 5 ±0.02 


248 ±1 


(4.4 + 4)x10~ 1 


10 + 0.5 


CANC 


0.14±0.008 


130±2 


(3.9 ± 2) x 10" 3 


4.7±0.3 


NC 


0.049 ±0.003 


122 + 3 


(2.3±1)x10~ 3 


3.9 ±0.3 


WM 


0.1 5 ±0.004 


180 ±3 


(1 .9 ± 1 ) x 1 0~ 3 


5.0±0.3 


3M 


0.1 6 ±0.001 


308 ±1 


(4.8 ± 3) x 10" 3 


8.8±0.5 


K30,32N 


0.14±0.007 


198 + 1 


(7.0±1)x10~ 3 


8.0 ±0.4 


AZF1 


0.1 6 ±0.007 


229 ±1 


(6.9 + 6)x10~ 1 


10 + 0.5 


AZF2 


0.1 7± 0.006 


193 + 1 


3.4 ±4 


10 + 0.5 


AZF1 +2 


0.1 6 ±0.004 


202 ±1 


1.7 + 2 


10±0.7 


H400C 


0.1 3 ±0.003 


205 ±1 


(1 .4 ± 0.9} x 1 0 _1 


8.3 ±0.4 


H421C 


0.1 7± 0.005 


212±1 


6.1 ±10 


13 ±0.8 


H400,421C 


0.1 6 ±0.006 


1 97 ± 1 


8.0 ±20 


11 ±1 


C258G Psi 


WT 


0.14±0.001 


331 ±4 


(6.3 ± 2) x 10" 5 


4.7±0.2 


NC 


0.071 ±0.006 


246 ±9 


(3.3 ± 2) x 10~ 5 


3.3 ±0.4 


AZF1 +2 


0.1 4 ±0.004 


227±3 


(1.3 + 2)x10~ 1 


8.8±0.8 


Psi-12M 


WT 


0.1 6 ±0.001 


322 ±5 


(1 .2 ± 0.4) x 1 0~ 3 


7.1 ±0.3 


All experiments were conducted in the presence of 30-40 nM fluorescent olis 


omer, 400 


nM protein unless otherwise indicated, 1 


mM MgCI 2 , 


20 mM HEPES, 10 uM TCEP, 5 mM 


BME, and varying 


; NaCI concentrations (50 mM-1 M) 


. All values represent the 


average of 


three or more trials with the associated standard deviation. 




a AA is the maximum change in anisotropy observed upon protein binding. 




Na-i/2 is the concentration of NaCI at wh 


ich the half anisotropy value is reached. 


c Kd(iM) is the affin 


ity of the specific, nonelectrostatic component of binding. 




Z e ff is the effective charge of the protein 


during its bi 


nding interaction with the RNA and 


also reflects the number of NaCI ions that are displaced during binding. 





RNA binding. Interestingly, the-K^iM) for TARPolyA binding 
by CANC was the same as for NC, and this value is ~ 10-fold 
smaller compared with WT Gag, suggesting that removal of 
MA allows optimization of nonelectrostatic NC interactions 
with RNA. The similarity between CANC and NC in binding 
to TARPolyA suggests that CA-CA interactions only play a mi- 
nor role in Gag-RNA binding. Moreover, this observation 
excludes the possibility that the Z e g of ~9 measured for 
Gag binding to TARPolyA is due to dimerization of Gag. 



Consistent with the notion of weak Gag- 
RNA binding cooperativity was also the 
observation that Psi RNA binds to Gag, 
CANC, and NC with similar Z e ff values 
and with K^cim) values, differing only by 
approximately twofold for Gag and NC. 

reflects all ions displaced upon nucleic 
acid binding — ions displaced by direct 
contacts between the RNA and protein 
and those displaced by protein-protein 
contacts occurring upon RNA binding. 
Because the difference between Gag and 
NC binding comes from additional CA- 
CA interactions in Gag, we conclude that 
the CA-CA dimer interactions are weak 
and primarily nonelectrostatic, consistent 
with their primarily hydrophobic nature 
(for review, see Ganser-Pornillos et al. 
2012). 

It was surprising to find that eliminat- 
ing dimerization leads to a major change 
in Gag binding to TARPolyA, as men- 
tioned above; WM-Gag has a significantly 
lower Z e ff (~5) compared with WT Gag 
(~9). Thus, eliminating Gag dimerization 
has a similar effect on non-Psi RNA bind- 
ing as deleting MA. Indeed, WM-Gag and 
CANC appeared very similar in their 
RNA-binding properties (Supplemental 
Fig. S2). Both proteins bound Psi and 
TARPolyA with Z e ff ~5, and their Xd(iM) 
values were ~ 10-fold smaller to Psi RNA 
than to TARPolyA, reflecting the RNA se- 
quence specificity of the NC domain. The 
reduction in positive charges interacting 
with TARPolyA in the case of WM-Gag 
suggests that the loss of Gag dimerization 
favors an NC-only binding mode. 

To further test the contribution of 
Gag's MA domain to TARPolyA and Psi 
binding, we examined two previously de- 
scribed Gag variants (Jones et al. 2011) in 
which the MA domain is mutated to be ei- 
ther more positively charged (E40R,E42L, 
N47K GagAp6, or Gag-3M) or more neu- 
tral (K30,32N GagAp6). Binding to both 
RNAs by Gag-3M was characterized by higher Na 1/2 values 
compared with any of the other proteins tested (551 mM 
for Psi RNA and 308 mM for TARPolyA) (Table 1; Supple- 
mental Fig. S3). The -Kd(iM) was reduced for Psi RNA binding 
(approximately fourfold) and TARPolyA binding (approxi- 
mately fivefold) (Table 1), reflecting additional nonelectro- 
static RNA contacts with Gag-3M, possibly due to the E42L 
mutation. Relative to WT Gag, the Z e g is increased for Gag- 
3M binding to Psi RNA (5 vs. 6.3), consistent with a 
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contribution to electrostatic binding from the more positive 
MA domain in this variant. In contrast, Gag-3M bound to 
TARPolyA with an effective charge similar to WT Gag, sug- 
gesting that the additional charged residues in the MA domain 
of Gag-3M do not contribute to binding to TARPolyA. 

Binding by Gag-K30,32N to both RNAs was characterized 
by a slightly lower Na 1/2 value relative to WT Gag, but greater 
relative to CANC (Table 1; Supplemental Fig. S3). The effects 
of the neutralizing mutations also reduce the effective charge 
of binding to TARPolyA (Z e ff ~8), which falls between the 
value for WT Gag and CANC {Z e g = 9.1 and 4.7, respective- 
ly). Nonelectrostatic binding by Gag-K30,32N to TARPolyA 
was largely unchanged relative to WT Gag, which would be 
expected, as the deletion of the MA domain only reduced 
the Kd(iM) by approximately sixfold. 

Taken together, the results of our studies of MA mutants 
suggest that both the MA and NC domains are involved 
in TARPolyA binding, but MA does not contribute to Psi 
RNA binding, as shown by lower Z e g values in MA variants. 
However, changes to -Kd(iM) m these Gag variants are mod- 
est when compared with the zinc finger variants described 
below. 



The zinc fingers in Gag are required for high-affinity 
Psi RNA binding 

Previous studies have shown that NC binds RNAs in a largely 
nonspecific manner via electrostatic interactions between the 
nucleic acid backbone and the basic residues in the protein 
(Levin et al. 2005; Fisher et al. 2006; Vo 
et al. 2009a; Darlix et al. 2011; Wu et al. 
2012). However, binding to certain 
RNA motifs, especially single-stranded 
UG sequences (Fisher et al. 1998), or ex- 
posed G residues in loop regions such as 
SL3 (De Guzman et al. 1998), involves 
high-affinity binding in which the zinc 
finger (ZF) aromatic residues stack with 
the G base in a hydrophobic pocket 
formed between the two ZFs (De 
Guzman et al. 1998; Darlix et al. 2011). 
To examine the contribution of the ZFs 
to Psi RNA and TARPolyA binding, we 
examined single and double-mutant 
Gag variants, in which either one or 
both critical ZF His residues were mutat- 
ed to Cys (Fig. 1). These H-to-C muta- 
tions do not disrupt the tetrahedrally 
chelated zinc ion, but alter the zinc finger 
fold (Julian et al. 1993), and both viral in- 
fectivity and RNA packaging are severely 
reduced in the double H-to-C mutant 
(Gorelick et al. 1999a; Kafaie et al. 
2008). We also examined ZF deletion 
variants in which either one or both 




ZFs were deleted (Fig. 1). These more severe changes also re- 
move basic and aromatic residues within the ZFs and elicit 
defects in gRNA packaging (Houzet et al. 2008; Kafaie et al. 
2008). 

For proteins in which both zinc fingers were disrupted, 
nonelectrostatic binding to Psi was severely reduced (Fig. 
4; Table 1), with a K d(1M) ~10 _1 M for AZF1+2 and 
H400,421C, values that were dramatically higher (~10,000- 
fold) than WT Gag. Thus, intact native ZFs are required for 
high-affinity nonelectrostatic binding. Importantly, in the 
double ZF mutants, Z e g values of ~9-10 were obtained 
for Psi RNA binding, suggesting that the loss of binding spec- 
ificity causes Gag to bind Psi RNA with a larger positive in- 
terface likely involving MA. Alternatively, loss of the zinc 
finger structure could allow for more favorable interactions 
between RNA and the basic residues in NC that would not 
be available for binding with intact zinc fingers, as suggested 
by others (Hargittai et al. 2004). Single ZF mutation variants 
were characterized by less severe binding defects with Z e ff 
values between 5 and 9, and the -KauM) va l ues ~10- to 100- 
fold higher than for the WT Gag (Table 1). An exception 
was AZF1, which had a Kd(iM) value similar to WT Gag. 
Interestingly, for ZF variants higher Z e ff values typically corre- 
late with higher J^( 1M ) values, indicating that the loss of 
binding specificity is commensurate with more positive char- 
ges of Gag directly contacting the RNAs. In viruses containing 
zinc finger deletion variants, gRNA packaging is reduced ~ 10- 
fold for AZF1 and AZF2 and ~ 100-fold for AZF1 + 2 (Houzet 
et al. 2008). 
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FIGURE 4. Salt titrations in the presence of zinc finger variant AZF1 + 2. WT GagAp6 is also 
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Gag binding to TARPolyA was also affected by ZF mu- 
tation, albeit to a lesser extent (Fig. 4; Table 1). ZF variants 
bound TARPolyA with ~10 _1 to 1 M K d( im), which was re- 
duced ~10- to 100-fold compared with WT Gag. Thus, 
TARPolyA binding by Gag also has a nonelectrostatic com- 
ponent associated with the ZFs of NC. Z e g was also increased 
slightly for ZF variants to ~10— 13 for ZF variants, compared 
with 9 for WT Gag. Therefore, as for Psi binding, the loss of 
nonelectrostatic binding for these proteins is accompanied by 
optimization of electrostatic contacts. 

Role of Psi RNA dimerization and single-stranded 
G bases in high-affinity Gag binding 

We next tested the contribution of Psi RNA dimerization to 
binding by introducing the C258G mutation to the DIS 
loop in SL1 of Psi RNA (Fig. 1A). This mutation reduces 
dimerization in virions by ~50% (Shen et al. 2000). Native 
gel analysis showed minimal dimer formation by the 
C258G variant over the range of salt (Supplemental Fig. 
S4A) and RNA concentrations (Supplemental Fig. S4B) 
used in this work. In contrast, WT Psi RNA readily dimerized 
under the same conditions (Supplemental Fig. S4). The pa- 
rameters obtained for Psi C258G RNA binding to Gag, NC, 
and AZF1 +2 were the same, within error, as binding to 
WT Psi RNA (Table 1; Supplemental Fig. S5), suggesting 
that Psi RNA dimerization is not essential for high specificity 
of Gag binding to the Psi RNA construct examined here. 
According to one model (Lu et al. 201 la), a structural switch 
in the RNA coordinates dimerization and packaging, and Gag 
binding may possibly induce this switch. Noting that the 
C258G mutant bound identically to WT Psi RNA, there ap- 
pears to be no preference for Gag to bind to dimerized RNA 
using this in vitro assay. However, Gag binding to full-length 
monomeric and dimeric vRNA could be altered by additional 
sequence elements not contained within our construct 
(Chamanian et al. 2013). 

We next investigated whether there were additional se- 
quence features of Psi that enhance Gag binding. We in- 
troduced 12 point mutations to Psi (Psi-12M) to remove 
single-stranded G bases in loops and bulges that have been 
proposed to be high-affinity NC binding sites based on 
SHAPE footprinting (Wilkinson et al. 2008). GagAp6 binding 
to Psi- 12M was characterized by an ~25-fold weaker nonelec- 
trostatic (i.e., specific) binding (-Kd( 1M ) ~ 1.2 x 10~ 3 ) (Table 
1) relative to WT Psi. This interaction is ~ 18-fold stronger 
than binding to TARPolyA, suggesting that some Gag-Psi 
RNA binding specificity resides in interactions beyond those 
that occur with the unpaired G bases. Additionally, the Z eff 
of binding to Psi-12M was ~7, which falls between the Z e g ob- 
tained for WT GagAp6 binding to Psi (~5) and TARPolyA 
(~9). Comparison of Gag binding to Psi and Psi-12M in 
Supplemental Figure S6 suggests that mutation of sites within 
Psi leads to the loss of most, but not all, of the Gag-Psi RNA 
binding specificity, implying that additional elements of 



primary and/or secondary structure contribute to Psi RNA 
recognition. 

DISCUSSION 

In this study we examined properties of recombinant HIV- 1 
Gag and NC binding to ~100 nt RNAs derived from the 5' 
UTR using an FA-based salt titration method to extract elec- 
trostatic and nonelectrostatic components of the strongest 
protein-RNA binding interaction. Surprisingly, we found 
that Gag bound Psi RNA and non-Psi RNA (i.e., TARPolyA) 
with markedly different biophysical characteristics. Psi RNA 
binding is optimized for specific, salt-independent interac- 
tions, and TARPolyA binding maximizes the total number 
of positive Gag charges binding to RNA at the expense of spe- 
cific contacts. This finding is consistent with the model that 
HIV- 1 Gag is a highly flexible polyprotein with two cationic 
RNA-binding sites in NC and MA domains (for review, see 
Rein et al. 201 1). Specifically, Gag interacts with an RNA de- 
rived from the \|/ portion of the genome with a Z e g and a 
^d(iM) that are twofold and 100-fold lower than the values 
measured for TARPolyA binding. The higher Z e g observed 
in Gag TARPolyA binding suggests that an additional cationic 
binding interface in the MA domain is involved. The data also 
suggest that MA plays a role in binding TARPolyA, but not Psi 
RNA, which is consistent with the finding that CANC, a MA 
deletion variant, binds TARPolyA with a Z e g similar to that 
of NC (~4) (Table 1). Surprisingly, the monomeric WM- 
Gag variant also bound to TARPolyA with reduced Z e s, sug- 
gesting that CA dimerization facilitates simultaneous NC 
and MA binding to non-Psi RNA. 

The zinc fingers of NC are the source of Gag's binding 
specificity to \|/, as shown by biochemical assays (Dannull et 
al. 1994), cell-based assays (Lever et al. 1989; Gorelick et al. 
1990), high-resolution structures (De Guzman et al. 1998; 
Amarasinghe et al. 2000), and experiments described herein 
(Fig. 4). Binding of double ZF variants to Psi RNA is char- 
acterized by a Kd(iM) that is 10,000-fold greater than the value 
measured for WT Gag. Interestingly, the parameters de- 
termined for the ZF variants binding to Psi RNA are similar 
to those of WT Gag binding to TARPolyA (Table 1), strongly 
supporting the conclusion that the ZF structures are criti- 
cal for binding specificity. In the NMR structures of NC 
bound to SL3, G320 of the SL3 tetraloop (Fig. 1) interacts 
with the first ZF of NC via residues Vall3, Phel6, Ile24, 
and Ala25, and G318 of the SL3 tetraloop contacts the sec- 
ond zinc finger via residues Trp37, Gln45, and Met46 (De 
Guzman et al. 1998). These primarily nonelectrostatic 
contacts of Gag's zinc fingers with Psi RNA are likely respon- 
sible for the ~ 1000-fold smaller value of Xd(iM) measured for 
Gag binding to Psi RNA relative to TARPolyA. Likewise, 
the much higher K^ iM -) observed for binding of Psi-12M 
RNA to WT Gag suggests that removal of single- stranded 
G residues from Psi RNA largely eliminates this RNA's 



1084 RNA, Vol. 19, No. 8 



Distinct selective HIV Gag/Psi binding 



enhanced binding specificity (Supplemental Fig. S6). Gag also 
bound Psi-12M with a higher Z e g, suggesting involvement of 
MA in addition to NC, a binding mode that resembles binding 
to TARPolyA. 

The TAR stem-loop has a well- documented role in bind- 
ing to the HIV- 1 Tat protein, which stimulates transcription 
of the full-length viral RNA (Ott et al. 201 1). In addition, NC 
binding to TAR RNA and its role in destabilizing the hairpin 
during the minus-strand transfer step of reverse transcrip- 
tion is well established (Kanevsky et al. 2005, 2011; Vo et 
al. 2009b; Levin et al. 2010; Heng et al. 2012). In an HIV-1 
variant that does not depend on Tat-TAR interaction for 
transcription activation, TAR is not required for gRNA pack- 
aging (Das et al. 2007), but TAR destabilization leads to ab- 
errant RNA dimerization and packaging (Das et al. 2012). 
Thus, although Gag-TAR interactions are not important 
for gRNA packaging, they likely occur during virus assembly 
(for review, see Lu et al. 201 lb). One recent study investigat- 
ing which portions of the 5' UTR are essential for gRNA 
packaging found TAR to be dispensable for genome packag- 
ing, whereas deletion of the PolyA stem caused an approxi- 
mately threefold decrease in packaged gRNA (Didierlaurent 
et al. 2011). This finding maybe due, in part, to a long-range 
interaction proposed to occur between the PolyA stem-loop 
and a downstream region in the viral RNA (Paillart et al. 
2002), which, if disrupted, leads to misfolding of the RNA. 
Compared with Psi RNA, TARPolyA binding to Gag and 
NC is characterized by a much higher -ST d(1M ) for both pro- 
teins and higher Z e ff in the case of Gag (Table 1). However, 
Gag's affinity for TARPolyA is approximately sixfold stronger 
than for Psi RNA at physiological ionic strength (~150 niM 
NaCl) (Fig. 3C). Thus, binding affinity alone is unlikely to ex- 
plain selective viral RNA packaging. 

Live cell microscopy studies of HIV- 1 assembly suggest that 
the number of Gag molecules selecting the gRNA in the cyto- 
plasm is relatively small (i.e., <10) (for review, see Jouvenet 
et al. 2011). These studies also show that assembly of HIV-1 
virus-like particles (VLPs) is nucleated at the plasma mem- 
brane, where a small number of Gag molecules are responsible 
for anchoring viral RNA in membranes. Interestingly, viral 
genomes are not retained at the plasma membrane when their 
packaging signals were mutated (Jouvenet et al. 2009). 
Moreover, in cells where HIV- 1 genomic RNA was available 
for packaging, ~85% of the observed VLP assembly events re- 
sulted in the encapsidation of gRNA. Although similar assem- 
bly times were measured in the presence or absence of gRNA, 
technical limitations did not allow visualization of less than 
~ 10 Gag molecules, thereby precluding observation of the ini- 
tial Gag-gRNA association kinetics. Thus, the high probabil- 
ity of gRNA packaging implies that the packaging selectivity 
likely arises from the binding of the first few Gag molecules 
to gRNA and is consistent with the hypothesis that specific 
Gag-gRNA complexes display faster early assembly kinetics. 

Based on our results, we hypothesize that the conformation 
of the first few Gag molecules strongly bound to \\i RNA via 




FIGURE 5. Model for selection and packaging of gRNA by the first few 
binding Gag molecules. Non-\]/ binding is characterized by Gag binding 
in a NC- and MA-bound conformation, but Gag binds y in an NC-only 
binding mode. The NC-only mode leaves MA free to interact with the 
membrane and has a kinetic advantage over complexes in which MA 
is bound to nucleic acids. 



zinc finger-specific nonelectrostatic interactions differentiates 
this complex from Gag bound to ribonucleoprotein complex- 
es containing cellular RNAs (Fig. 5). Whereas \|/-RNA binding 
involves only the NC domain, non-\|/ binding is characterized 
by electrostatic interactions with both NC and MA, while the 
nonelectrostatic contacts are reduced. Thus, although the MA 
domain binds to the plasma membrane irrespective of the 
identity of the RNA bound to Gag, we propose that Gag bound 
to RNAs that lack v|/ are kinetically hindered in binding to the 
plasma membrane in comparison to \|/-containing complexes 
(Fig. 5). In non-\|/ complexes, MA is partially occupied by nu- 
cleic acid binding via the same interface used for PIP 2 binding 
and, therefore, these complexes have a kinetic barrier to plas- 
ma membrane binding that involves MA-RNA dissociation 
(Shkriabai et al. 2006; Chukkapalli et al. 2010, 2013; Datta 
et al. 2011; Jones et al. 2011). We propose that the availability 
of the MA domain for plasma membrane binding in the Gag 
molecules bound to \|/-containing gRNA could confer a kinet- 
ic advantage for gRNA-Gag complexes to initiate assembly 
(Fig. 5). 

In summary, in this work we have explored the effect of 
multiple mutations and deletions in Gag on the specificity 
of binding to Psi RNA vs. TARPolyA RNA. We show that 
Gag can bind to different RNA molecules with distinct bind- 
ing modes and these mechanistic differences may have im- 
portant implications for gRNA packaging. Importantly, this 
mechanism accounts for selective \|/ RNA packaging even 
when the binding affinities of the competing protein-RNA 
complexes at physiological salt are similar. Our data also sug- 
gest that the zinc finger structures in the Gag NC domain and 
interactions with single-stranded G-rich motifs in \\i confer 
specific Gag binding. Future salt titration studies of Gag bind- 
ing to additional Psi RNA mutants, as well as other vRNA- 
derived and unrelated RNA molecules, will further explore 
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the details of the interactions responsible for Gag's binding 
specificity. 

MATERIALS AND METHODS 

Preparation of proteins and nucleic acids 

All Gag proteins used in this study lack the p6 domain. WT GagAp6 
and variants were prepared as previously described (Datta and Rein 
2009; Jones et al. 2011). Briefly, the preparation involves ammoni- 
um sulfate precipitation and ion-exchange (phosphocellulose resin) 
purification, followed by size exclusion chromatography. Fractions 
containing protein were then concentrated, aliquoted, and stored 
at — 80°C. Some of the NC was a gift of Dr. Robert Gorelick 
(NCI-Frederick) and was purified as previously described 
(Urbaneja et al. 1999). NC prepared by solid-phase synthesis and 
reconstituted with zinc (Liu et al. 2005) was also used. TARPolyA, 
Psi, and PsiC258G RNAs were in vitro transcribed from linearized 
plasmids, originally cloned from plasmid pMSMAEnv containing 
HIV-1 NL4-3 cDNA, a gift of Dr. Kathleen Boris-Lawrie (The 
Ohio State University). The PsiC258G mutant plasmid was prepared 
from the WT Psi RNA plasmid by Quik-change site-directed muta- 
genesis (Stratagene). Psi-12M RNA is derived from WT Psi RNA 
and contains the following 12 point mutations: G240,241A, 
G272.273A, G290A, U291A, G292A, G310A, G318.320A, and 
G328,329A. Psi-12M RNA was transcribed from a PCR-assembled 
template using the following DNA oligonucleotides purchased 
from Integrated DNA Technologies: 

5'-TAATACGACTCACTATAGGGACGCAAAACTCGGCTTGCT 
GA-3', 

5'-CGCAAAACTCGGCTTGCTGAAGCGCGCACGGCAAGAAAC 
GAGGGGCGGCGACTGAAAAGTACGCCAAAAATTTTT-3', 
and 5'-AAAAATTTTTGGCGTACTTTTCAGTCGCCGCCGCC 
TCGTTTCTTGCCGTGCGCGCTTCAGCAAGCCGAGTTTTG 
CG-3'. 

RNAs were fluorescently labeled with fluorescein on their 3' ends 
as previously described (Pagano et al. 2007; Jones et al. 2013). Con- 
centrations and labeling efficiencies were determined using the fol- 
lowing extinction coefficients — fluorescein, 8.5 x 10 4 M" 1 cm -1 ; 
WT Psi, 8.7 x 10 5 M" 1 cm" 1 ; PsiC258G, 8.7 x 10 5 M" 1 cm" 1 ; Psi- 
12M, 8.7 x 10 5 M" 1 cm" 1 ; TARPolyA, 9.26 x 10 5 M" 1 cm" 1 . After 
labeling, RNAs were aliquoted and stored at — 20°C. Analysis of 
RNAs by native gel electrophoresis is described in the Supplemental 
Information. 

FA equilibrium binding assays 

Direct FA-binding experiments were performed as previously 
described (Stewart-Maynard et al. 2008). Varying amounts of 
GagAp6 were incubated with 30-40 nM 3' -fluorescently labeled 
RNA in a buffer containing 20 mM HEPES (pH 7.5), 1 mM 
MgCl 2 , 10 \iM tris-2-carboxy-ethyl phosphine (TCEP), 5 mM (3- 
mercaptoethanol (BME), and varying amounts of NaCl. NC-con- 
taining reactions were identical except that they contained 20 uM 
TCEP and two equivalents of zinc acetate per NC. Reactions were 
incubated in the dark at room temperature for 30 min prior to FA 
and fluorescence intensity measurements using a SpectraMax M5 



plate reader (Molecular Devices). Fitting of the direct titration 
curves is described in the Supplemental Information. 

FA salt titration assay 

Reaction conditions were the same as in direct binding measure- 
ments, except that protein concentrations were held constant (400 
or 750 nM) and NaCl concentrations were varied from 50 mM to 
1 M. This protein concentration was chosen because the RNA bind- 
ing had reached a plateau according to equilibrium binding mea- 
surements performed at 50 mM NaCl (Supplemental Fig. SI). 
However, only the strongest protein-binding sites on RNA are satu- 
rated under these conditions (see Supplemental Information). 
Therefore, our salt titrations allow characterization and comparison 
of the highest affinity NC or Gag sites on each RNA molecule stud- 
ied. To correct for the difference in solution viscosity and/or confor- 
mational changes in RNA molecules created by the increasing NaCl 
concentration, FA values in the absence of protein were subtracted 
from the FA for protein-containing reactions in each experiment. 
The corrected data were fit to sigmoidal curves using OriginPro8 
Software (OriginLab Corporation), which yielded Na 1/2 values or 
the concentration of NaCl at which 50% protein is dissociated 
from the labeled RNA. The dissociation constant iC d (Na) varies 
with [Na] as follows: 

K d (Na) = K d(m • [Na] 2 * (1) 

Equation 1 is derived in the Supplemental Information and follows 
from the features of nonlinear screening of nucleic acid charges 
by cationic proteins or ligands (Manning 1976; Record et al. 1976; 
Rouzina and Bloomfield 1996a,b,c). Substituting equation 1 into 
the binding isotherm (Supplemental equation S4) allows fitting 
of the two basic parameters describing the salt-sensitive protein/ 
RNA binding: Kd(iM) an d Z e g, as described in the Supplemental 
Information. 

SUPPLEMENTAL MATERIAL 

Supplemental material is available for this article. 
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